A New Approach for Hindi Optical Character Recognition Based On Neural Networks
نویسنده
چکیده
Assistant Professor, HIET, Kaithal (Haryana) E-mail: [email protected] 2 Assistant Professor, NIT, Kurukshetra (Haryana) E-mail: [email protected], Assistant Professor, HIET, Kaithal (Haryana) E-mail: [email protected] Assistant Professor, HCTM, Kaithal (Haryana) E-mail: [email protected] Abstract —OCR is the acronym for Optical Character Recognition. This technology allows a machine to automatically recognize characters through an optical mechanism. Human beings recognize many objects in this manner our eyes are the "optical mechanism. Development of OCRs for Indian script is an active area of activity today. Optical character recognition (OCR) is the mechanical or electronic translation of images of handwritten, typewritten or printed text (usually captured by a scanner) into machine-editable text. In simple words OCR is a visual recognition process that turns printed or written text into an electronic character based file. OCR is a field of research in pattern recognition, artificial intelligence and machine vision. Though academic research in the field continues, the focus on OCR has shifted to implementation of proven techniques. A lot of work had been carried out for OCR at international scenario but in Indian context a concrete approach for character recognition is still required as scripts of Indian languages are from the group of most complex scripts and it is very hard to recognize them. Indian scripts present great challenges to an OCR designer due to the large number of letters in the alphabet, the sophisticated ways in which they combine, and the complicated graphemes they result in. The problem is compounded by the unstructured manner in which popular fonts are designed. There is a lot of common structure in the different Indian scripts. All existing OCR systems developed for various Indian scripts do not provide sufficient efficiency due to various factors. The objective of this paper is to discuss a more efficient character recognition technique. This paper introduces a new technical approach to recognize Indian script characters which are unpredictable due to different problems in other OCR’s.
منابع مشابه
Implementation of Feed-forward Neural Network Models for Pattern Classification Using Transformation Based Feature Extraction Methods
Automatic recognition of handwritten Hindi characters is a difficult and one of the most interesting research areas of pattern recognition field. A lot of work has been done in this area till date; still it is a subject of active research. Hindi characters are cursive in nature and thus characters may be written in various cursive ways. Characters also show a lot of similar features such as hea...
متن کاملRecognition of Handwritten Hindi Characters using Backpropagation Neural Network
Automatic recognition of handwritten characters is a difficult task because characters are written in various curved & cursive ways, so they could be of different sizes, orientation, thickness, format and dimension. An offline handwritten Hindi character recognition system using neural network is presented in this paper. Neural networks are good at recognizing handwritten characters as these ne...
متن کاملHindi Numeral Recognition using Neural Network
Handwriting has continued to persist as a means of communication and recording information in day-to-day life even with the introduction of new technologies. The constant development of computer tools lead to the requirement of easier interface between the man and the computer. Handwritten character recognition may for instance be applied to Zip-Code recognition, automatic printed form acquisit...
متن کاملOptical Character Recognition for Hindi Language Using a Neural-network Approach
Hindi is the most widely spoken language in India, with more than 300 million speakers. As there is no separation between the characters of texts written in Hindi as there is in English, the Optical Character Recognition (OCR) systems developed for the Hindi language carry a very poor recognition rate. In this paper we propose an OCR for printed Hindi text in Devanagari script, using Artificial...
متن کاملA Novel Transfer Learning Approach upon Hindi, Arabic, and Bangla Numerals using Convolutional Neural Networks
Increased accuracy in predictive models for handwritten character recognition will open up new frontiers for optical character recognition. Major drawbacks of predictive machine learning models are headed by the elongated training time taken by some models, and the requirement that training and test data be in the same feature space and consist of the same distribution. In this study, these obs...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011